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Genes identified in Asian SLE GWASs are also 
associated with SLE in Caucasian populations 

Chuan Wang 1 , Annika Ahlford 1 ' 16 , Tiina M Jarvinen 2 ' 17 , Gunnel Nordmark 3 , Maija-Leena Eloranta 3 , 

Iva Gunnarsson 4 , Elisabet Svenungsson 4 , Leonid Padyukov 4 , Gunnar Sturfelt 5 , Andreas Jonsen 5 , 

Anders A Bengtsson 5 , Lennart Truedsson 6 , Catharina Eriksson 7 , Solbritt Rantapaa-Dahlqvist 8 , Christopher 

Sjowall 9 , Heikki Julkunen 10 , Lindsey A Criswell 11 , Robert R Graham 12 , Timothy W Behrens 12 , Juha Kere 2 ' 13 ' 14 , 

Lars Ronnblom 3 , Ann-Christine Syvanen 1 and Johanna K Sandling*' 1 ' 15 

Recent genome-wide association studies (GWASs) conducted in Asian populations have identified novel risk loci for systemic 
lupus erythematosus (SLE). Here, we genotyped 10 single-nucleotide polymorphisms (SNPs) in eight such loci and investigated 
their disease associations in three independent Caucasian SLE case-control cohorts recruited from Sweden, Finland and the 
United States. The disease associations of the SNPs in ETS1, IKZF1, LRRC18-WDFY4, RASGRP3, SLC15A4, TNIP1 and 
16pll.2 were replicated, whereas no solid evidence of association was observed for the 7qll.23 locus in the Caucasian 
cohorts. SLC15A4 was significantly associated with renal involvement in SLE. The association of TNIP1 was more pronounced 
in SLE patients with renal and immunological disorder, which is corroborated by two previous studies in Asian cohorts. The 
effects of all the associated SNPs, either conferring risk for or being protective against SLE, were in the same direction in 
Caucasians and Asians. The magnitudes of the allelic effects for most of the SNPs were also comparable across different ethnic 
groups. On the contrary, remarkable differences in allele frequencies between Caucasian and Asian populations were observed 
for all associated SNPs. In conclusion, most of the novel SLE risk loci identified by GWASs in Asian populations were also 
associated with SLE in Caucasian populations. We observed both similarities and differences with respect to the effect sizes 
and risk allele frequencies across ethnicities. 
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INTRODUCTION 

Systemic lupus erythematosus (SLE) is an autoimmune disorder with 
a heterogeneous clinical spectrum, extending from mild cases limited 
to skin and joint manifestations to life-threatening conditions with 
renal impairment, severe cytopenias, central nervous system disease or 
thromboembolic events. The contribution of heritable factors to SLE 
predisposition is evident from the strong familial aggregation and 
high disease concordance in monozygotic twins (24-56%).' Since 
2008, >25 potential risk loci for SLE have been identified in genome- 
wide association studies (GWASs) in Caucasian populations. 2-6 These 
risk alleles account for about 8-15% of the total genetic susceptibility 
to the disease. 7,8 

Shortly after the findings of risk alleles for SLE in Caucasians, 
GWASs with similar or larger sample sizes were conducted in East 



Asian populations. 9,10 These studies not only confirmed the 
association signals for some of the major genetic loci reported in 
Caucasians, including BLK, IRF5, STAT4, TNFAIP3, TNFSF4 and the 
HLA region, but also discovered novel SLE risk loci. Two of the novel 
SLE genes, TNIP1 and IKZF1, were independently identified in 
Caucasian 7,11 and Asian 9 populations, suggesting that the majority 
of genetic risk factors for SLE are common across Caucasians and 
Asians. However, a higher proportion of SLE risk loci seem to be 
shared between different Asian populations compared with the loci 
they share with Caucasian populations. 12 In addition, higher 
incidence and prevalence of SLE, as well as differences in clinical 
manifestations have been reported for Asian populations. 13 These 
observations indicate potential genetic heterogeneity for SLE between 
Asian and Caucasian populations. 
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In this study, we investigated whether 10 single-nucleotide poly- 
morphisms (SNPs) located at eight SLE risk loci, including ETS1, 
IKZF1, LRRC18- WDFY4, RASGRP3, SLC15A4, TNIP1, 7qll.23 and 
16pll.2, which were originally identified by GWAS in Asian popula- 
tions, 9 also confer disease risk in a collection of Caucasian SLE 
patients and healthy controls from Sweden and Finland. We also 
present data for these loci from our previous study on Caucasians 
from the United States. 

MATERIALS AND METHODS 

Subjects 

The Swedish cohort included 1129 SLE patients and 2060 controls. Patients 
were recruited from the Rheumatology clinics at the Uppsala, Karolinska 
(Solna), Umea, Lund and Linkoping University Hospitals, and the controls 
were either healthy blood donors (Uppsala, Umea, Lund and Linkoping) or 
controls from the population-based Epidemiological Investigation of Rheu- 
matoid Arthritis cohort. 14 Detailed clinical characteristics were available for 
873 Swedish SLE patients (Supplementary Table SI). The Finnish cohort was 
composed of 270 SLE patients recruited from the Helsinki, Tampere and 
Kuopio University Hospitals and 343 controls from multiple regions in 
Finland. 15 ' 16 The US cohort, consisting of 1310 SLE patients and 7859 
controls has been previously described. 3,7 All participants included in the 
current study were of Caucasian origin based on self-report. All SLE patients 
fulfilled at least four of the classification criteria for SLE as defined by the 
American College of Rheumatology (ACR). 17 The study was approved by the 
ethics committees of all involved institutions. Informed consents were obtained 
from all SLE patients and controls. 

Genotyping 

Ten novel polymorphisms from a SLE GWAS in Han Chinese 9 were selected 
for genotyping. One additional SNP (rs7708392, TNIP1) was selected from a 
previous study on Caucasians. 7 Individuals from the Swedish and Finnish SLE 
case-control cohorts were both genotyped for the eleven SNPs with a 384-plex 
GoldenGate assay using VeraCode microbeads on the Ulumina BeadXpress 
system (Ulumina Inc, San Diego, CA, USA) at the SNP and SEQ technology 
platform in Uppsala, Sweden (www.genotyping.se). One SNP failed genotype 
clustering (rs4639966, llq23.3). Genotype data for the 10 remaining SNPs 
were extracted from the larger set of 384 SNPs, and the data for both cohorts 
were then quality controlled together. A genotyping success rate of 95-97% 
was achieved for all 10 SNPs. Call rates for the Ulumina GoldenGate assay will 
largely depend on the SNPs in the oligo pool. Our 384-plex SNP panel 
included a mix of novel (not previously validated) and common variants 
which could have contributed to the observed slightly lower call rates. In total 
106 individuals with more than two failed SNPs were excluded from 
subsequent analyses. All 10 SNPs were in Hardy-Weinberg equilibrium 
( P- value > 0.005) in the control individuals from both cohorts. For the US 
SLE case-control cohort, quality controlled genotype data for 8 out of the 10 
SNPs was available from a previous study by Gateva et aV The 10 SNPs 
genotyped and analyzed in the Swedish and Finnish cohorts in the current study 
were selected independently of what data that was available for the US cohort. 

Statistical analysis 

Quality control of the genotype data, Hardy-Weinberg equilibrium tests, and 
association analyses were conducted using the software PLINK. 18 We 
performed allelic association analysis using Fisher's exact tests by comparing 
the allele counts of SNPs in SLE patients and controls, in case numbers in any 
of the groups would be so small that the ^-approximation would be invalid. 
Odds ratios (ORs) were reported relative to the risk allele identified in Asians. 
Similarly, subphenotype case-only analyses were performed by comparing allele 
counts for the patients who fulfilled a specific ACR classification criterion for 
SLE to all the SLE patients who did not fulfill that specific criterion. 
Subphenotype case-control analyses were performed by comparing allele 
counts for the patients who fulfilled a specific ACR classification criterion 
for SLE to healthy controls. A P- value 0.05 was considered significant. 
Heterogeneity of allelic effects across cohorts was evaluated with Cochran's 



Q-test and the I 2 heterogeneity index. 19 Meta-analysis was carried out using 
the software GWAMA v2.0.7, 20 in which a fixed-effects model was used for 
SNPs with a P-value>0.05 in Cochran's Q-test and a random-effects model for 
SNPs with a P-value<0.05. For rs7708392 and rs4917014 meta-analysis was 
conducted based on the data from the Swedish and the Finnish cohorts, and 
for all other SNPs data from all three cohorts were included. Population 
attributable risk percentage (PAR%), which considers both the OR and risk 
allele frequency (RAF) in the general population, was estimated using the 
formula RAF(OR-l)/(RAF(OR-l) + l). 21 Statistical power was estimated using 
the software Quanto vl.2.4 (http://hydra.usc.edu/gxe/) with a log-additive 
model, a two-sided type I error rate of 0.05 and SLE prevalence of 0.068% in 
Sweden, 0.028% in Finland and 0.053% in the United States. 22 

RESULTS 

Allelic association analysis in three Caucasian SLE case-control 
cohorts 

Ten SNPs in nine novel SLE loci discovered in Asian populations, 9 but 
which had not yet been replicated in Caucasians, were selected for 
inclusion in the study. These loci were ETS1, IKZF1, LRRC18- 
WDFY4, RASGRP3, SLC15A4, TNIP1, 7qll.23, llq23.3 and 16pll.2. 
A SNP in TNIP1 (rs7708392) from our previous study on Caucasians 
published at the same time was also included, as this SNP differed 
from the one reported in Asians. 7 We genotyped and investigated the 
disease associations for these SNPs in two Caucasian SLE case-control 
cohorts from Sweden and Finland. Data for these loci were then 
extracted from our previous study on Caucasians from the United 
States. In the Swedish, Finnish, and US SLE case-control cohorts, 
consistent and significant association signals were detected for the 
SNPs rsl913517 and rs6590330, which are annotated to the gene loci 
LRRC18-WDFY4 and ETS1, respectively, (Table 1, Supplementary 
Table S2). The associations with SLE for the TNIP1 (encoding 
TNFAIP3-interacting protein 1) and SLC15A4 (solute-carrier family 
15, member 4) genes were supported by data from two cohorts: 
TNIP1 (rsl0036748) in the Swedish and US cohorts, and SLC15A4 
(rsl0847697 and rsl385374) in the Swedish and Finnish cohorts. In 
the remaining third cohort the association was in the same direction 
as in the other two but with marginally significant P-values. The SNPs 
tagging the RASGRP3 (RAS guanyl nucleotide- releasing protein 3) 
and 7qll.23 loci appeared to be associated with SLE in the Swedish 
cohort, but failed to replicate in the Finnish or the US cohort. IKZF1 
(Ikaros family zinc finger 1) was also exclusively associated with SLE 
in the Swedish cohort, but no SNP tagging this gene was tested in the 
US cohort. Additionally, the SNP rs7197475, which is located in the 
16pll.2 locus, showed a trend of association in the Swedish cohort 
only. The SNP rs4639966 in the llq23.3 locus failed genotyping in the 
Swedish and Finnish cohorts. In the Swedish and the US cohorts, we 
have > 58% power with a hypothesized OR of 1.2, and > 89% power 
with a hypothesized OR of 1.3 for the SNPs without nominally 
significant P-values; whereas the Finnish cohort is smaller in size and 
provides limited power (10-35% and 16-62% with a hypothesized 
OR of 1.2 and 1.3, respectively). 

In a meta-analysis significant P-values for the association with SLE 
were detected for all SNPs except rsl 167796 at the 7qll.23 locus 
(Table 1). In addition, the effects of all the associated alleles, either 
conferring risk for SLE or being protective against SLE, were in the 
same direction as reported in the Asian cohort. 

Comparisons of allele frequencies, effect sizes and risk across 
cohorts 

In order to more comprehensively evaluate the differences in disease 
association for the SLE risk loci between Asian and Caucasian 
populations, data from the Swedish, Finnish and US cohorts in the 
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current study were analyzed as one combined Caucasian cohort. In 
addition, data on the TNIP1 SNP rs6889239, which is in perfect LD 
with rs 10036748 tested here, from a published UK SLE case-control 
study was also included in this Caucasian cohort. 11 Analogously, 
summary data were retrieved from three published Chinese SLE 
association studies 9 ' 10 ' 23 and two published Japanese SLE association 
studies, 12 ' 24 and summarized into one Asian cohort. 

The RAFs of all 10 SNPs in control individuals differed significantly 
between the combined data sets of Caucasians and Asians (Fisher's 
exact test P-value< 10" 10 , Table 2). Especially for the TNIP1 SNPs 
rsl0036748 and rs7708392, and rsl913517 in LRRC1 8- WDFY4, their 
minor alleles in Caucasians were the major alleles in Asians (Figure 1). 
Despite the clear difference in allele frequencies for the SNPs between 
Caucasians and Asians, their effect sizes, which were estimated by ORs 
derived from meta-analysis, were not as distinct across populations 
(Table 2, Figure 1). The effect sizes of the SNPs rs6590330 and 
rs7197475 were significantiy larger in the Asian population (Cochran's 
Q-test P-value<0.05, 7 2 ^0.84), and moderate heterogeneity of allelic 
effect (0.49 </ 2 ^ 0.61) was detected for the SNPs rsl3385731, 
rsl0036748, rs4917014, rsll67796 and rsl913517. The contributions 
of all ten SNPs to the risk of SLE were estimated with PAR%, which 
considers both RAF and effect size. Higher PAR% values were 
detected for most of the SNPs in the Asian population (Table 2). 
Notably the PAR% value of the ETS1 SNP rs6590330 was about five 
times higher in Asians than in Caucasians. For the SNPs rs7708392, 
rsl0036748 and rsll67796, located in the TNIP1 and 7qll.23 loci, 
their risk contributions in Asians were more than double the size than 
in Caucasians. 

The RAFs and effect sizes for the 10 SNPs were also compared 
across the Swedish, Finnish and US SLE case-control cohorts 
(Figure 1, Supplementary Table S2, Supplementary Table S3). Most 
of the SNPs showed some differences in RAFs across the three 
cohorts, although the differences were not as pronounced as 
compared with the Asian cohort. In contrast, the effect sizes of all 
SNPs were comparable in different Caucasian cohorts. Only margin- 
ally significant differences were detected for the SLC15A4 and the 
7qll.23 loci, which seemed to confer a stronger risk in the Swedish 
cohort than in the US cohort. PAR% values across different Caucasian 
cohorts were also variable, which was mainly a consequence of the 
differences in allele frequencies. 



Association analysis within SLE patients with different disease 
manifestations 

Considering that the genetic risk factors for SLE influence different 
cellular pathways, their contribution to disease pathogenesis may 
differ across disease manifestations. Therefore, we performed case- 
only association analysis in subgroups of Swedish SLE patients with 
ACR classification data available (Supplementary Tables SI, S4). 
Significant association signals were detected for the genes TNIP1 
and SLC15A4 in the SLE patients with renal disorder (ACR7, 
Supplementary Table S4). Together with ETS1, TNIP1 seemed to also 
be associated with the immunological disorder criterion (ACR10). In 
addition, RASGRP3 showed association signals in the SLE patients 
with neurological disorder (ACR8), and 16pll.2 was associated with 
photosensitivity (ACR3) as well as serositis (ACR6). However, only 
the association for SLC15A4 in the SLE patients with renal disorder 
(ACR7) remained significant after multiple testing correction. We also 
performed a subphenotype case-control analysis, where we found 
TNIP1 and SLC15A4 to be associated with 8 of the 11 SLE subgroups 
after applying multiple testing correction (Supplementary Table S5). 
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Table 2 Comparisons of RAFs, effect sizes and PAR% between Caucasians and Asians 
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Abbreviations: 


square, heterogeneity 


ndex; OR, odds ratio; PAR%, population attributable risk percenta 


ge; Q net , Cochran's Q-test; RAF, risk allele frequency; SNP, single-nucleotide polymorphism. 



a Two-tailed P-value calculated by comparing the counts of risk alleles in the combined Caucasian and combined Asian data sets of controls with Fisher's exact test. 
b P-value calculated with Cochran's Q-test which compares the allelic effects based on OR values estimated by meta-analysis. 
c The heterogeneity index i 2 quantifies the proportion of variation in allelic effects based on OR values estimated by meta-analysis. 
d PAR% calculated using the formula RAF(0R-1)/(RAF[0R-1] + l). 21 
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Figure 1 Comparisons of odds ratios (ORs) and allele frequencies across cohorts. The median of ORs for individual cohorts are plotted with open symbols in 
gray, and the counterparts estimated from meta-analysis in Caucasian and Asian populations are plotted with solid symbols in black. ORs are calculated 
based on the risk alleles of SNPs reported in Asian populations. Error bars represent 95% confidence intervals (CIs). The pie charts at the bottom panel 
illustrate the allele frequencies of SNPs in patients and controls. Charts for the combined data sets of Caucasians and Asians are in a darker shade. 



DISCUSSION 

By investigating 10 SNPs in three independent Caucasian SLE case- 
control cohorts, we replicated the association signals from 9 out of the 
10 SLE SNPs originally identified by GWASs in Asian SLE case- 
control cohorts. Applying the more stringent Bonferroni correction, 
five of these would still be significant. With the current study design, 
the statistical power for all the significant association signals was 
>65% in the individual Caucasian cohorts and >99% for the 
meta-analysis. Of the associated genes, both LRRC18 (leucine-rich 
repeat-containing protein 18) and WDFY4 (WD repeat and 
FYVE domain-containing protein 4) are of unknown function, 
whereas ETS1 encodes an ETS transcription factor involved in B-cell 
development, mutation of which leads to autoimmune disease in 
mice. 25 TNIP1 is a regulatory protein that inhibits signaling by 



several receptors, including TNF-alpha, TLR and PPAR. 26 SLC15A4 
is highly expressed in antigen-presenting cells, and has been 
shown to regulate TLR9- and NODl-dependent innate immunity 
responses. 25 The IKZF1 transcription factor is a key regulator within 
the hematopoietic system and is involved in the regulation of 
STAT4. 26 

Despite the differences in allele frequencies of these SNPs between 
the Caucasian and Asian populations, the risk alleles of all associated 
SNPs were shared by Caucasians and Asians. In addition, most of the 
associated SNPs showed comparable effect sizes across ethnic groups. 
This is also true for the majority of the previously reported SLE risk 
loci from GWASs for both Caucasians and Asians, such as BLK, IRF5, 
STAT4, TNFAIP3, TNFSF4 and the HLA region. These SLE GWASs 
have generally been performed using a similar number of cases and 
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controls for both ethnicities, thus differences in the power to detect 
significant associations for individuals SNPs across ethnicities appears 
to largely depend on their allele frequencies. These observations 
underscore the overall genetic homogeneity of SLE. 

However, there are differences: for instance, the SNP rs6590330 in 
the ETS1 locus shows distinct allele frequencies and effect sizes in 
Caucasians and Asians. Its contribution to the risk of SLE is more 
pronounced in Asians although it is a disease-associated SNP in both 
populations. The SNP rs7197475 in the 16pll.2 locus has a low RAF 
and a large effect in Asians, but it is common in Caucasian 
populations with a relatively weak contribution to the risk of SLE. 
Generally the differences in SNP allele frequencies seem to be the 
major source of genetic heterogeneity rather than the differences in 
effect sizes. These differences in allele frequencies are most likely a 
result of genetic drift, but could possibly also be caused by differences 
in selection due to environmental factors. The present study was 
performed on 10 Asian SLE SNPs that at the time of initiation of our 
study were identified as novel compared with the previous GWASs on 
Caucasians with SLE. Since then, several more studies have been 
initiated in Asian, which have identified additional loci for SLE. 10 ' 12 
Therefore, a next step will be to perform a more comprehensive 
independent analysis for population heterogeneity of all published 
variants associated with SLE in Caucasians and Asians. 

In addition, for some of the SNPs we did not detect consistent 
association signals from all three Caucasian cohorts, which may be 
due to the differences in RAFs suggested by variable PAR% across the 
three Caucasian cohorts. There is always an inherent risk in 
comparing genotype data from studies performed at different centers. 
The genotypes for the Swedish and Finnish cohorts were quality 
controlled together, and the US cohort had previously been exten- 
sively QCd as outlined in Gateva et al, 7 we were not able to apply the 
same extensive QC to the Swedish and Finnish cohorts as we do not 
have genome-wide SNP data for these. However, as the genotyping 
assays used in all studies analyzed here are well validated, and our 
study focuses on common variation we believe the risk for genotyping 
errors to be minimal. Thus, we believe that the small differences in 
allele frequencies we observe between the Caucasian populations are 
real and not artefacts. Statistical power may be another factor that 
influences our observations. In the Swedish and the US cohorts, we 
have reasonable power, whereas the Finnish cohort is smaller in size 
and provides limited power for any robust conclusion of association 
for the SNPs without nominally significant P-values. 

A notable finding in our study was that the TNIP1 gene was 
nominally associated with SLE in patients with renal or immunolo- 
gical disorder, both in the case-only and case-control subgroup 
analyses. The association of TNIP1 in the SLE patients with renal and 
immunological disorder has also been reported in two independent 
studies on Chinese and Japanese populations. 23 ' 24 Epidemiological 
studies have observed higher rates of renal involvement in SLE 
patients of Asian origin, 27 which may corroborate our finding that the 
contribution of TNIP1 to the disease association was 2.1-2.6 times 
higher in the Asian cohort than the Caucasian cohort. In addition to 
SLE, association of TNIP1 with psoriasis has been reported for both 
Caucasian and Asian populations. 28 ' 29 However, the SNPs reported 
for psoriasis are only in weak LD with the two TNIP1 SNPs studied 
here (r^O.ll in Caucasians and r 2 ^ 0. 18 in Asians, HapMap release 
no. 28). TNIP1 has also been found as a risk factor for systemic 
sclerosis (SSc) in a GWAS based on Caucasian populations. 30 
Interestingly, the TNIP1 SNPs associated with SSc are in medium 
LD in Caucasians (Cl.13^^^0.58) but in weak LD in Asians 
(r^O.lO) with the two SLE-associated SNPs in this study. Further 



studies in patients with SLE and other autoimmune diseases of 
different ethnical origins and with well-defined clinical characteristics 
will help to clarify the contribution of TNIP1 to the differences in 
disease manifestations across populations. The association for 
SLC15A4 and renal involvement was the only signal to pass 
multiple testing correction in both the case-only and case-control 
subphenotype analyses. This has to our knowledge not previously 
been reported in Caucasians or Asians. 

In addition to genetic differences across ethnicities, variation in 
environment, culture, socioeconomic status and methodological 
factors could also underlie differences in disease prevalence and 
manifestations between ethnic groups. 27 ' 31 Therefore, a multi-ethnic 
study design is superior to studies based on a single ethnic group, 
because in the former design the influence of potential non-genetic 
factors are neutralized, which makes the study more robust in 
dissecting the impact of genetic factors in complex diseases such as 
SLE. Here, we have studied common SLE risk variants, which are 
likely to be shared between different populations; it remains to be 
seen if rare alleles contribute to risk of SLE in multiple populations in 
a similar manner. 
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